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1. Introduction 



Over the last decade, there has been increasing interest in how network heterogeneity may affect nonequi- 
librium dynamics in qualitative ways [2]. One of the simplest and most important examples has been the 
susceptible to infected (SI) and suspectible to infected to removed (SIR) epidemic models, famous from 
epidemiology [22J, which model disease outbreaks in populations. A decade ago, [28] first demonstrated 
that heterogeneous networks can fundamentally alter the dynamics of these processes in qualitative ways 
- in particular, the epidemic threshold vanishes on scale free graphs of degree 7 < 3, so epidemics always 
infect a nontrivial fraction of nodes on an infinite graph, with finite size corrections later shown to be 
extremely small [27] Q Later, in [1] it was shown that the removal of this threshold also corresponds to 
faster than linear epidemic growth on such graphs. Many authors have all explored various aspects of 
the dynamics of epidemic spreading. [61 El HJ [T9j [8] extend analysis of mean field theory, and within this 
framework |14|. 124] discuss the late time behavior of epidemics on scale free graphs, with [131125] introduc- 
ing some dynamical aspects. [291 E2]> closest in spirit to this work, present reductions of the dynamical 
equations, although their approach is quite different. Mathematicians have used many complicated tech- 
niques to obtain information about generalizations of such solutions to more complicated epidemic types 
[12\ I26j. but have typically avoided studying the complication of adding an entire network structure. [9 J 
introduces an extension where bipartite graph structure can be reasonably accounted for by mean field 
theory, leading to a model of sexually transmitted disease (STD) epidemics. 

In addition to processes which may be well modeled by the SIR epidemic, there are many others 
which share the same structure of the SIR epidemic - irreversible flow from S — > I — > R. A related 
example of such a process is that of rumor spreading |21|, [20] , which is similar to that of SIR epidemic 
spreading but with a "death rate" which is proportional to the current number of infected edges. A 
slightly more complicated version of the model also allows for the infected nodes to die on their own [23], 
but the fundamental difference between this model and the epidemic is captured without this term. Other 
irreversible processes, such as a new model for recommendation spreading in a population [5], are also 
very similar, even if they do not have an identical S — > I — > R structure. 

In this paper, we will present mean field dynamical solutions to the following 4 models: the SIR 
epidemic, the SI epidemic on bipartite graphs, a simplified model of rumor spreading in which only 
infected edges can induce transitions to the removed state, and the recommendation spreading model. 
These solutions should, for all intents and purposes, be regarded as exact - the only approximation that 
they require is mean field theory, and they allow for reconstruction of all dynamical quantities of interest 
within the scope of mean field theory (most easily by numerical methods). For each model, the exact 
solution can be found for arbitrary degree distribution, when written in the form of an integral over a 
function defined based on the degree distribution of the underlying network. We will typically make 
some simplifying approximations to reduce the amount of work we have to do in analyzing the theoretical 
dynamics, but we stress that these approximations can be removed. 

There are numerous reasons why the existence of such exact mean field solutions for arbitrary (mean 
field) networks is helpful. Other exact solutions have typically either focused only on the behavior at very 
late times [24], or focused on very special types of graphs like the nearest neighbor ID lattice [30] . or 
expressed as series solutions, which obscure the physical meaning of the solution [lOj . Most importantly, 
the exact solution allows one to determine the accuracy of mean field theory, beyond a comparison of 
scaling behaviors. Furthermore, an exact solution provides dynamical information about the nature of the 
epidemic away from the fixed points of the dynamics, as well as precise information about the dynamics 
in regimes where linearized approximations break down, and we will indeed find more precise answers 

1 In this paper, we will often casually say "no epidemic threshold" when we are really referring to epidemic thresholds 
which vanish rapidly with N, the size of the network. 
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than we have found in the literature. We will present a basic analysis of the resulting equations as well 
as compare our results to numerical simulations, which are typically quite accurate. For simplicity, we 
will almost always work with scale free graphs, where the exact solution can be expressed in terms of 
integrals over incomplete T functions with well understood properties - furthermore, such graphs capture 
the essence of how network structure can dramatically change the qualitative dynamics. 

The paper is organized as follows. Section [2] discusses the epidemic models, while Section [3] describes 
the rumor spreading models and Section 0] discusses the recommendation model; Section [5] presents a 
discussion of the work. Numerical results are presented as we discuss the theory. 

As this work was being finalized, we discovered a recent series of papers [16} [T71 [T8] which discuss 
modeling variations of the SIR epidemic by reduction of the dynamics to finite sets of ODEs, using 
a technique somewhat related to ours. The focus of this work is quite different, emphasizing scaling 
behavior and asymptotic dynamics, as well applying this technique to models beyond the scope of epidemic 
spreading. 



2. SIR Epidemics 



We begin by discussing the exact mean field solutions, and numerical corroborations of these solutions, 
for the epidemic spreading models. We first discuss the general structure of an "epidemic like" process, 
then move on to the SIR epidemic, then describe why the irreversibility is so crucial, and finally discuss 
the SI STD epidemic. 

2.1. General Overview of Epidemic Processes on Networks 

This section is meant as a brief review of the nature of an epidemic-like process on a network, and the 
well-versed reader may happily skip it or skim it to ensure that he understands our notation. 

We begin by quickly reviewing what we mean by a network, or graph. An (undirected) graph is 
a set of vertices V, along with a set of edges E, with an edge e G E associated to a pair of vertices: 
e = (uv) = (vu) with u,v £ V. The degree of a vertex (or node) v, which we will label k v , is the number 
of edges in E with one of the ends of the edge being v. The SIR epidemic is a stochastic process defined 
on such a network. The state space for this stochastic process is given by {S,I, R}'^ - i.e., each node 
can exist in state S, I or R. In theory, the SIR epidemic is a continuous stochastic process, with the rate 
of transition between states being defined as follows: if two graph configurations differ by more than 1 
node, then no transitions are allowed. If the graphs differ by one node, than the following transitions are 
allowed: 

„ , , _ f u:S->I with rate k v 9 v , . 

for each node v € V : < . _ .,, . , 1) 

(_ v : I — > R with rate A v ' 

where 

_ number of edges which point from v to a node in state I 
Vv = 7 (2) 

Note that we have chosen to measure time in units where the rate of transition from S to I is 1, per edge. 

The intuition for the above process is straightforward. If a node is an S, it is susceptible to becoming 
infected, which occurs by an interaction with an infected neighbor. The more infected neighbors the node 
has, the more likely the node is to catch the infection from one of them - we assume this rate is linear. We 
then assume that a node dies with a constant rate once they catch the disease. There are many obvious 
variations on such a process, although most of them will not be likely to have an exact solution of the 
type found in this paper. We will consider a few simple processes of this form which do have such exact 
solutions. 
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It is well-known that mean field theory is typically a far better approximation to dynamical processes 
on such networks than on a graph like a hypercubic lattice, as the random structure of the graph, and 
the large number of edges, mean that the network itself helps to "average" over states [2j. In this paper, 
we will always assume that \V\ — )■ oo (the number of nodes is getting infinitely large) - this is the regime 
where mean field theory should work best. Mean field theory will treat all nodes with the same k v as being 
the same, and so all we will care about is pk, the fraction of nodes in V which have k v = k, and Sk, Ik 
and R k , the fraction of nodes which have k edges which are in state S, I, or R respectively. Conservation 
of probability tells us that 

S k + h + Rk = l (3) 
and so we can neglect the dynamics of Rk- The other key approximation of mean field theory will be that 

v = d=\£ k Pk ] 1 £ k Pk I k = (4) 

where we are using angle brackets to denote averages with respect to the distribution 
2.2. Solution for Scale Free Graphs 

The mean field equations of the SIR epidemic are easy to write down, given the rules above: 

S k = -kOS k , (5a) 
i k = kes k - AJ fc . (5b) 

Now, let us reduce this infinite set of dynamical equations, assuming that all nodes in the graph have at 
least m edges. We begin with (f5al) : 

Sk _ dff fc _ -kOSk _ k^Sk_ , s 

S m dS m -m6S m mS m 

This can be easily integrated to give, if we assume that <Sfc(0) ~ S m (0) « 1: 

S k (t) = S m {t) k ' m . (7) 
For later convenience, we will introduce the variable 

z(t) = - log S m (t), (8) 

and we find we have reduced (15al) to 



i = m6. (9) 

As we show in Figure [TJ numerical simulations suggest that ([7D becomes very quickly quantitatively true 
for a decent range of k as soon as the epidemic takes off. We use scale free graphs, with 

p k ~ @(k - m)AT 7 (10) 

for simulations for the entirety of this paper, as that is where the dynamics becomes most interesting, 
and where our mean field solutions will become easier to write down. In all of our simulations, we use 



2 (J4j| is a bit simplistic, because since every infected node (other than a starting "seed" infected node) was infected by 
contact with some other infected node, in reality an infected node with k edges could at most transmit the infection to ft — 1 
other states. However, we will only simulate things on graphs where each node has at least 5 or so edges, and this will not 
turn out to have a very large qualitative, or quantitative, impact on the discussion. It is also be very straightforward to 
remove this approximation, at the expense of introducing some more terms into the equations. 
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m = 100 To generate quality scale free graphs, we use the preferential attachment algorithms of [11] Fl For 
a bit larger m, the values of z become significantly higher, but this is a numerical fragment (— logO = oo 
- i.e., all nodes of a given connectivity have been infected or removed), and so we have truncated these 
unphysical values from our graph. 




Figure 1: — log S k as a function of k at various times. We generated scale free graphs with N 
nodes, degree 7 = 3.5, and death rate A = 9, and averaged over 200 trials. 



5000 



Now, we turn to (I5bj) . and we find an equation for 9: 



(k) 



J2jt } [^s k -xki k ] 



Now, using that = e kz / m ; we find that: 



_d9 _ 

z dz m{k) 



1 \ " , 2 -kz/m 



A_ 

m ' 



which implies that 



-kz/m 



m 



(11) 



(12) 



(13) 



Now, using (|10p . let us approximate that our graph is scale free. This will turn out to make 6(z) have 
(approximately) an exact expression in terms of well-understood functions: 



Xz , 
+ 1 

m 



dk 



(7 — l)m 7 1 



7-1 



-m 



00 

-(7-2) J dx e~ 



.7-2 

. 1 /Z\7-l 



Z \X 



ke -kz/m 



-( 7 -2)z^ 2 r(2-7,. 



(14) 



3 We checked that this assumption did not lead to any qualitative changes in behavior - e.g., if m = 5 or m = 20, the 
dynamics are very similar. 

4 Other papers, e.g. [4], show that the specific algorithm used to generate a scale free graph does not result in any 
qualitative change to the dynamics, so we will not worry about this point. 
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Note that the 7 and m dependent factors we have introduced are so that the probability distributions 
integrate to 1. We have also used identities out of [T]: here T(a,z) is the upper incomplete T function. 
Using another identity we find 

6(z) = z~ f ~ 2 T{3-j,z) + l-e- z --z. (15) 

m 

We then find that we have reduced the dynamics, under fairly benign approximations, to a very simple 
form: 

i = mz T- 2 r(3 - 7, z) + m (1 - e~ z ) - Xz. (16) 
We can thus write down the exact mean field solution, (within our mild approximations): 

dz' 

(17) 



m(l - e- z ' + z'^- 2 )r(3 - 7, z')) - Xz' 



*(0) 

Note that we require a very small z(0) factor to regularize divergences - we will discuss the physical 
consequences of this shortly. The physical meaning of this factor, as the initial condition of the dynamics, 
is clear. We should also note that by simply replacing the denominator of (|17h with m8(z), we have the 
exact solution for an arbitrary graph. 

While we have an exact solution, since it involves an integral, it is easier to just analyze (|16p . It is 
straightforward to justify by considering the asymptotic behaviors of the various terms that there are at 
most two fixed points: z = is always a fixed point, and if it is unstable, there is an absolutely stable 
fixed point at z = z* > some finite point. To analyze the stability of the z = fixed point, about which 
dynamics occur, we re-write (|16p as z — > 



r(3-7,*) , ; 

m 5 h m — X 

z 3- 7 



(18) 



Suppose that 7 > 3. Using yet another identity from [I] concerning the small z behavior of the T function 
term, we find that 

>-2 



.7-3 

which implies the existence of an epidemic threshold: 



-m — X 



(19) 



t — 2 

A c = ^^m. (20) 
7-3 

For A < A c , epidemics will not spread, whereas they will for A > A c . Since the fixed point at finite positive 
z is always absolutely stable, we conclude that for A / A c , the dynamics are always linear near fixed 
points. Since these are the slow points of the dynamics, we conclude that the time scales of the dynamics, 
the spreading time T spre ad) and the ending time r en d, should be 

O(l) 
f dz 

^spread ~ ^end ~ / — ~ log iV. (21) 
l/N 

Of course, we do not take our precise approximation of A c too seriously, but the key point is simply that 
there is an epidemic threshold, and a finite time scale of the epidemic dynamics, when 7 > 3. This fact 
is well known [3]. 
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Now, let us consider the case where 7 < 3. Here, the T function ratio is now divergent as z — > 0, and 
so the dominant term of the dynamics is 

z ~ z^- 2 . (22) 



^I.n-a.l« / ^ ~ ^ffi = 0(1). (23) 



From this we find the spreading time scale is 

O(i) 

In the case of 7 = 3, we have that T(0, z) ~ — log z, and so denoting 

y = -logz, (24) 
we find that we can approximate the dynamical equation by 

y ~ -y (25) 

for large y, with initial condition yo ~ log iV. This immediately gives us that 

Spread ~ log log N. (26) 

It was argued heuristically, and shown numerically, in [3] that the growth of epidemics was faster than 
linear for scale free graphs with 7 < 3. Here, however, we have a more precise claim that the time scale 
of epidemic spreading is in fact independent of the size of the network (except in the special case 7 = 3). 
We similarly find for this case that r en d ~ log N. 

Now that we have an exact solution and understand its important properties, the most important 
question is whether or not we can use the exact solution to actually determine the dynamics of various 
functions of interest: Sk(t), ifc(i) and Rk(t). Of course, it will suffice to find the first two, and the first 
follows directly from ([8]) and ([7]). To find ifc(i), we can use the following trick: 



d_ 

df 



I k (t)) = ke- kz(t)/m 8(z{t)). (27) 



Having found z(t), we can recover 

I fc (t) = / ds e- x{ - t - s he- kz ^l m 6{z{s)). (28) 



where we have approximated that ifc(O) ~ 0. It is likely not possible to do these integrals by hand, but 
they could be done numerically. 

Figure [2] compares the equation (I16p . the result of mean field theory, to numerical simulations. We 
see that the qualitative sketch of the mean field trajectory is reproduced by the simulated dynamics for 
the range of N tested, but quantitatively the curves appear shifted a bit, which is expected due to some 
of our approximations. Interestingly, we see that for 7 = 3.5, the mean field theory slightly lags behind 
the simulations, whereas for 7 = 2.5, the mean field theory leads the simulated dynamics. This suggests, 
perhaps, that the sharp transition observed in mean field theory between 7 > 3 and 7 < 3 is likely not 
quite as sharp in the actual dynamics on a network^ 



5 Another issue is that N = 2000 may be far too small to see a difference, but we did not have the computing power 
available to test this. 
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Figure 2: Comparison of mean field theory prediction for z(t) to numerical simulations. We used N = 
2000, m = 10 and averaged over 50 trials. The unphysical jumps in A = 2 dynamics are due to trials 
where all nodes with k = 10 became infected. 



It is not hard to understand qualitatively what will happen if we assume that p k does not describe a 
scale free network. In this case, we will no longer have an explicit form for the answer, but we can still 
understand the qualitative behavior by studying the quantity 

C( 7 ) = ton 

K— YOO 

We can use the divergences in C(j) to bound the dynamics on our given graph by replacing the graph's 
degree distribution with a non-normalized p^ ~ fc~ 7 , to find bounds in z ~ 9{z). Crudely speaking 
(7(7) ~ k^pk for large k, but to take care of a network where some of the pk may be 0, we will use the 
above definition. If (7(3 — e) = 00 for some e > 0, then we conclude that r sprcac j ~ 0(1). The case where 
(7(3) < 00 but (7(3 + e) = 00 for any e > implies that r sprea d ~ log log N, which we obtain by bounding 
the spread time both from below and above by bounding 9{z) by two scale free distributions of degree 
7 = 3. If (7(3 + e) < 00, then we conclude that T spre ad ~ log N. For this last case, there is an epidemic 
threshold independent of N, while for the former cases, there is only an epidemic threshold vanishing as 
N 00. 

2.3. SIS Epidemic? 

A natural question to ask, given our success with mean field theory above, is whether or not we can do 
something for the SIS epidemic. In the SIS epidemic, instead of dying (transitioning to state R), nodes 
transition to state S with rate A. The mean field equations in this case are given by [28J: 

4 = k9(l - I k ) - XI k . (30) 

Numerous problems arise in this case. One of the major problems is that since it is possible to become 
susceptible again, we do not have the simple reduction of the S dynamics to a single equation. The second, 
critical, problem is that 9 is not proportional to 9 - instead, we get a "tower" of dynamical equations for 
the probability of looking at an infected node weighted by k 2 , k s , etc. This implies that the irreversibility 
of the SIR epidemic is crucial for the exact solutions found above. 



k — m — 1 z — ' 

n=m 



k 

^ n'pn 



(29) 
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2.4. STD Epidemics on Scale Free Bipartite Graphs 

A natural extension of the above discussion is the STD epidemic model on bipartite scale free graphs, as 
introduced in [9] l£| The basic idea of this model is that there are two networks, a "male" network and a 
"female" network, such that all edges are between a male and female. The mean field theory we used in 
the previous parts would be a bad approximation here, because we do have two distinct types of nodes, but 
at the expense of doubling the number of dynamical variables to Sm/o Sfa:> ^Mfc and Ip^, referring to the 
probability that a male/female node is susceptible and male/female node is infected respectively, we can 
correct for this. For simplicity, let us assume that the male graph is scale free of degree 7m, and the female 
graph is scale free of degree jp. The extension of the mean field equations above is straightforwardly 

SFk = —kOuSFk, (31a) 
<SMfc = -k9 F Suk, (31b) 
I F k = hOyiS-ph — A/pfe; (31c) 

-^MA; = kOpSyik ~ AI]yifc, (31d) 

with 9 F and 9m defined in the same way as before: 

9 F = — — S2 k PFkhk, (32a) 



(*>F 



By defining z F and Zm as before: 



0F = Trr~ k PMklMk- (32b) 
(«)m ^ 



z F = -logS Fm , (33a) 
z M = - logSWi, (33b) 



we find, using the same tricks as above, 



z F = uiOm, (34a) 

zm = m9 F , (34b) 

9 F = (tf - 2)mz 7F " 3 r(3 - 7F , z F )9 M - X9 F , (34c) 

= (7M - 2)m^ M ~ 3 r(3 - 7 M, z M )e F - X9 M . (34d) 

We have not found a way to solve these equations nearly exactly. The difficulty comes in via the 
mixing of #f and 9m, which render the division trick we used earlier useless. However, we can solve a 
simplified version of the model. Consider the case where A = - this should be a decent approximation 
to the case where A < 1 anyways (so the epidemic spreads very rapidly), and should give us qualitative 
insight into the nature of spreading. In this case, we can once again employ the division trick, and we 
find that, just as before, using identities out of [I]: 

#f(^f) = ^ F " 2 r(3 - 7F , z F ) + 1 - e" 2F = 1 - ( 7F - 2)z^" 2 r(2 - 7F , z), (35a) 
M^m) = ^M M ~ 2r ( 3 " 7m,^m) + 1 - e" ZM = 1 - ( 7 m - 2)z^ 2 T(2 - lM ,z). (35b) 



6 Actually, this paper considered the SIS epidemic. But as we just mentioned, the SIS epidemic does not have a nice 
solution - at least not using our techniques. 



7 We assume that the rates are not M/F dependent, for simplicity, as was done in [S]. 
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Now, we use that 

to find that 
where 



Returning to our assumption that the graphs are scale free: 

- 2 + z^ 1 r(2- 7 ,z)] . 



z F _ dz F _ 6> M 
z M dzM Of 

F(z F ;~/ F ) = F(z M }Ju) 

z 

F{z) = [ dz'e(z'). 



7 



(36) 
(37) 

(38) 
(39) 



Now, to understand (|3T[) in the regime of interest (for small z), we perform asymptotic expansions on F. 
We find that the lowest order non vanishing terms are given by 



F(z;<y) 



7 



2 7 - 3) 
z i 1 

7-1 



7 > 3 

7 = 3 
2 < 7 < 3 



(40) 



Let us look at a few examples of what this implies about the dynamics as the epidemic gets started. 
Suppose that 7f > 3 and 7m > 3. It is easy to see that ((371) implies that 



z F 



'(7f-3)(tm-2) 
(7f - 2)(tm - 3) 



ZM, 



or 



q _ Q V(7F-3)(7M-2)/(7F-2)( 7 M-3) 



(41) 



(42) 



We should not take the precise exponent here particularly seriously, but just note that the fraction of 
male susceptible nodes is some power of the fraction of female susceptible nodes. Now, let us consider 
the case where 7f > 3 but 7m < 3. Then we find 



z F 



' 2(tf - 3)r(3 - 7m) ( 7 m-i)/2 
(7f-2)(tm-1) M 



(43) 



This is a surprising result - for very small t, the female nodes get infected at a rate more than exponentially 
faster than to the male nodes, although this range of times is not very long. 

We can also see quickly that a similar result for r sprca( j holds: if 7m>7f > 3, the spreading dynamics 
are 0(log N); they are 0(1) in the case of 7m < 3. In the case of 7m, 7f > 3, this follows from ([IT]) and 
(USD: 

z F = muyi ~ m -zyi = m\ 



'(7M-2)( 7F -2) 



7M 



(7M - 3)(tf - 3) 



z F . 



and similarly for zm- In the case of 7m < 3, 7f > 3, we have instead, using (|43j) and (|3ol) : 

^ ~ ft,, ~ _ 2( 7m -2)/(7m-1) 

Z F ~ UM Zy[ Z F 



(44) 



(45) 
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Since 



< 2 



7M 



< 1 (2 < 7 M < 3) 



(46) 



7m - 1 

we conclude that growth is faster than linear, and that the spreading dynamics is 0(1) for the same reason 



as in the SIR epidemic. In the case of 7f > 3, 7m 



3, we find that since z\ ~ 



log zm, that 



z F = z M log — « ZFi/log — + 0(log log Z F ) 

Z M V ZF 



Defining y-p 



log zp as we did earlier, we find that 

''"spread 



v/logiV 



(47) 



(48) 



In the case of 7m = 7f = 3, we can find that r spre ad ~ log log N just as before. 

To generate bipartite scale free networks for use in simulations, we used a similar algorithm to what is 
used in [SJ, which unfortunately does not guarantee that all F nodes have at least 10 edges. However, we 
see that this does not significantly ruin the dynamics, and they match mean field theory extremely well, 
as shown in Figure HI although they are a bit lower than mean field theory would predict in the range 
of validity. Figure [3] shows that the fraction of susceptible nodes (for both M and F) is exponentially 
decaying with k, as mean field theory predicts. Together, these suggest that mean field theory is a valid 
dynamical approximation at all times, notwithstanding finite size limitations. 



3 

2 - 



log S k 



- 




□ female, 7m = 2.5 

• male, 7m = 2.5 

□ female, 7m = 3.5 

• male, 7m = 3.5 



10 



15 20 

k 



25 30 



35 



Figure 3: — log 5m*; and — logSpfc on SI STD epidemics on graphs with N = 5000 nodes and 7f = 3.5, 
averaged over 400 trials. We used times t = 0.24 and 0.32 for 7m = 3.5, and 0.24 for 7m = 2.5, to avoid 
finite size effects (which become visible for the blue lines), as discussed earlier. We have checked that 
other parameters lead to similar linear relations. 



3. Rumor Spreading 



Now, let us turn the discussion to models of rumor spreading. The essential idea of the rumor spreading 
model is that people can be described as either unaware of the rumor (state S), actively spreading the 
rumor (state I), and not actively spreading the rumor, and having heard of it (state R). The key difference 
with the SIR epidemic is that the death rates will now change. 

There are 2 possibilities. The classic rumor spreading model, which we will denote "type IR" rumor 
spreading, corresponds to a situation where every edge that connects a given node in state I to a state in 
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Figure 4: Comparison of z(t) between theory (solid line) and simulations (dotted line) for the SI STD 
epidemic model. We used N = 2000, m = 10, and 100 trials. 



either I or R induces transitions to R with rate A. We will instead consider a simplified version, which we 
denote "type I" rumor spreading, where only I nodes induce such transitions. Type I rumor spreading is 
perhaps not as realistic as type IR, for dynamical reasons which will become clear, but it will admit an 
exact solution of the same type as we have found before, so we will focus our discussion on this model. 
First, we begin by discussing type IR rumor spreading, and describe what can be obtained from mean 
field theory. 

3.1. Type IR Rumor Spreading 

Let us define 

V> = £^. (49) 

The mean field equations are 

Sk = —kOSk, (50a) 

i k = kes k - \k(i - i>)i k . (50b) 

We will not find a way to nearly exactly solve the above equations, even for a scale free graph. Furthermore, 
essentially all of the results we find in this section can be found in [23], but we repeat them here for 
completeness, and because we derive them in a slightly quicker way. We begin by noting that introducing 
z as we did before, we find the exact same relation that Sk = e~ kz / m . In particular, this means that (once 
again, for simplicity, assuming pf. ~ k~ y ) 

oo 

= E = £ ^e-W™ « 2_1 J dk e -*/m = ( 7 _ 2)Z ^ T (2 - 7 , *). (51) 

m 

In general, we can find an expression for ip(z) for more complicated degree distributions, but we may not 
be able to find the exact solution. Given tp(z), (j50b|) becomes 

i k = k6e- kz/m - \k(l - ip(z))I k . (52) 

Unfortunately, it is far from obvious how to solve these differential equations exactly. Although 
they are linear in /, they involve diagonalizing a nontrivial infinite dimensional matrix. We will content 
ourselves to merely understanding the location of the fixed point z* . To find z* , we note that 

^(1) = E P*** = E k Pk^ kz/m 6 - A(l - m) E k P^k = <*>[<¥(*) - A0(1 - rp(z))]. (53) 
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At t — > oo, this should go to 0, so we conclude that 

*W = XTT- (54) 

We can say more about the state of the graph at the fixed point: the mean field theory clearly predicts 
that Sk(oo) decreases exponentially with k. This fact was known to [20], but a theoretical reason was not 
known. 

Since the focus of this paper is on discovering exact solutions, let us now turn to type I rumor spreading, 
which we will discover does have an exact solution. 



3.2. Type I Rumor Spreading 

Let us now turn to the simplified model of type I rumor spreading, with mean field equations 

Sk = —kOSk, (55a) 
4 = k9Sk - Xk6I k . (55b) 

It is clear that Sk = e~ kz / m as before. We now may exploit a different trick than the one we have 
previously used. Consider 

h_ = dh_ = _ 1 + 

Sk dSfc Sk 

Then we see that by defining 

w k S k = h, (57) 
([56]) becomes, assuming for simplicity that A ^ lH 

5^ = -l + (A-lH, (58) 
a&k 

which for appropriate initial conditions, implies 

1 log ^-^-^ logfr, (59) 



A-l ° -1 
or 

i , . p—Xkz/m p — kz/m 

h = — {Si - S k ) = — . (60) 

Now, from here, we can directly compute 9{z). As we expect, 9{z) has an explicit expression for a scale 
free graph under the sum to integral approximation: 

oo 

^dfc(^) 7 Q - = l-i[(Azr- 2 r(2- 7 ,Az)-^- 2 r(2- 7 ,.)] (61) 



and therefore obtain 



z = m9 = J — lm [(Az) 7 ~ 2 r(2 - 7, Xz) - z 7 " 2 r(2 - 7, z)] . (62) 
1 — A 



Using r function identities we can re-write this expression: 



e ~ Xz - e~ z - (Az)^ 2 r(3 - 7, Ag) + z^ 2 T(3 - 7, z) 

z = m (63) 

1 — A 



The case of A = 1 is not difficult to solve, but we do not present it in this paper. 
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Just as before, we can find the exact solution by finding t in terms of z, expressed as an integral. 
Interestingly, we should note that for type I rumor spreading it is actually far easier to extract the 
relevant physical information: Sk and Ik, than for the SIR epidemic. Determining Sk is the same as for 
the epidemics, but this time we can simply read off Ik from f)60f) . 

Let's analyze the behavior of this equation for small z. When 7 > 3, we use the asymptotic expansions 
for z ~ 0: 

-Az + z-(3-7)~ 1 (Az-z) 7-2 f . 

z « m = mz, 64 

1-A 7-3 ' v ; 

which is precisely what we would have found had we naively assumed that the short time behavior of 
the rumor spreading was behaving like a SIR epidemic with effective death rate of 0. Our intuition 
thus implies that we should have expected the absence of an epidemic threshold, which is indeed what 
we see. However, the intuition of approximating rumor spreading as an epidemic fails for the case of 
7 < 3, interestingly, where the dominant asymptotic behavior near the origin comes exclusively from the 
r functions: 2 

i^mr(3- 7 ) 1 ~ A7 . z 7 " 2 . (65) 
1 — A 

Here, interestingly, we see that the death rate has an effect on the short time dynamics even for small z: 
the A dependent factor behaves like 1 for A< 1, and A - ( 3 ~ 7 ) for A > 1 (as expected, higher death rates 
suppress the growth of the epidemic) . We also note that it is obvious from here that T S p rea d has the same 
scaling behavior as with the SIR epidemic: O(logiV) when 7 > 3, 0(loglog N) when 7 = 3, and 0(1) for 
7 < 3. 

Figure [5] shows plots of the simulated rumor spreading, compared to mean field theory. We see that 
at initial times, mean field theory is an excellent approximation, although it begins to significantly break 
down at large z. The reason for this will be explained in the next subsection. Figure [6] shows that Sf. is 
still exponentially decaying with k for the rumor spreading models. While for earlier times, the optimal 
linear fit requires a nonzero intercept with the z axis, the qualitative picture of mean field theory holds 
very well. 
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Figure 5: Comparison of z(t) between theory (solid line) and simulations (dotted line) for type I rumor 
spreading. We used N = 2000, m = 10 and averaged over 50 trials. It required a time step of At ~ 0.01 
before the simulation appeared to accurately reflect continuous time dynamics. 



3.3. Late Time Type I Dynamics 

The above discussion focuses on the early time dynamics. For late times, we will see that type I rumor 
spreading is a simple example of a process where we should expect mean field theory to completely break 
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Figure 6: — log as a function of k at various times. Here we show the example of growth on a scale free 
graph of degree 7 = 3.5 with N = 5000 nodes and death rate A = 4, averaged over 200 trials. 



down, something which we observed in Figure [5j 

Let us begin by naively assuming that mean field theory is an accurate description, and see what we 
find. Proceeding as before: 



■ ^7-2 

z ~ -m 

1 — A 

This implies that, letting A = min(l, A), 



e" Az 



?y — O p -min(l,A)z 

|1 — A|m min(l, X)z 



Tend~ J dz Aze Az ~ Az*e Az * . (67) 
O(l) 

Now, we have to be careful about z*. In the type I rumor spreading, once all of an infected node's 
neighbors die, he will stay infected forever. Suppose we are on a fully connected graph - then it is clear 
that z* = -log(l/A0 = log A^, and thus 

r end ~iV A logiV. (68) 

This is a very interesting and strange result - the time scale itself of the epidemic ending is extremely 
sensitive on the parameters of the problem, until the critical point when A = 1, in which case, roughly 
speaking, the epidemic spreads by pairs becoming infected, with one of the two quickly dying off. 

This expression for r en( j is completely incorrect, however, for a graph which is not fully connected. 
Here, it becomes a little bit subtle to determine the correct z* . The basic intuition we have proceeds as 
follows. Typically, the more connected a node was, the more likely it was to have gotten infected early, 
and to have died quickly. Therefore, the nodes which survive are the ones with fewer connections. Now, 
let us consider for simplicity, only the nodes which have on the order of the fewest connections, m. If we 
choose a node to "live" and kill all of its neighbors, repeating this process until we have saved or killed all 
nodes, then, since we expect to kill ~ m nodes each time, we should expect that s m ~ m _1 , or z* ~ logm. 

However, if the dynamics is driven to a fixed point at z* ~ logm, then we know that the mean field 
theory description must have completely broken down, since there is no fixed point for finite z. The naive 
guess is that since the fixed point occurs at z* = 0(1), the fixed point is absolutely stable, and therefore 
Tend ~ log N. We can qualitatively see this result holds up against numerical simulations, shown in Figure 
[7J Interestingly, we see that the dynamics ends fastest when A ~ 1, and becomes slower both for large 
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and small A. This has an intuitive interpretation - for A <C 1, the ending dynamics is slow because we 
are waiting for death events, which take a very long time; for A > 1, the ending dynamics is slow because 
deaths occur so fast that the rumor /infection must propagate "one node at a time" with a creation of an 
I-I edge quickly followed by one of the two dying. 



7"end 




100 200 400 800 1600 3200 6400 
N (logarithmic plot) 



Figure 7: The ending time, averaged over 50 trials, on scale free graphs with 7 = 3.5. We can see that 
tend ~ log N. To speed up simulations, we used fairly large time steps - we do not think this should alter 
the qualitative nature of the end time dynamics, although this may make our simulated r en d too small. 



4. Recommendation Spreading 



We now show that a very recently proposed model for recommendation in social systems [5] also has 
an exact solution in terms of an integral, just as we found above. In this model, there are 3 states: a 
susceptible node (S), an accepting node (A), and a denying node (D). Instead of SIR-type dynamics, the 
dynamics of this model are as follows: if an S comes in contact with an A, it will transition to an A with 
rate 1, and a D with rate A. This occurs per edge, so the mean field equations are 

D k = 1 - A k - S k (69) 
S k = -(l + \)keS k , (70a) 

A k = kes k . (70b) 

Here we are using A k and D k for the fraction of nodes with k edges in states A and D, respectively. From 
our above work, it is clear that these equations have an exact solution in terms of an integral. 
For simplicity, let us focus on the case of a scale free graph. We find that 

Sk dSk -(1 + A), (71) 



using conservation of probability, and 



A k dA 
which implies that 



^LJk- (72) 
1 + A v ' 
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This implies that, to good approximation, using z as defined above: 

1 - (7 - 2)zT~ 2 r(2 



1 



1 + A (A:) ^ 
We immediately see that 



kp k A k 



7-2 



1 + A 



r(3-7,z) + 1 -e~ 
1 + A 



m [z 7 



2 r(3-7,z) + l-e" 



(73) 



(74) 



At mean field level, we recognize this as exactly the same as SI epidemic dynamics. This is not an 
accident, and we will explain why this occurs shortly. Our previous analysis implies that T sprea d ~ logiV 
if 7 > 3, ~ log log N if 7 = 3 and ~ 0(1) for 7 < 3. In this case, for large z, the dominant term in the 
dynamics is actually the term 1, so we conclude that T en d ~ log TV" for this model. Figure [8] compares the 
theoretical dynamics of this model to mean field theory, where we see excellent agreement for 7 = 3.5 (for 
short times, at least) and qualitative agreement for 7 = 2.5, but with the simulated z a bit smaller than 
theoretically predicted. We should finally note that for the same reasons as in the type I rumor spreading 
model, Sk, Ik and Rk may be easily recovered from the mean field solution. 
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Figure 8: Comparison of z(t) between theory (solid line) and simulations (dotted line) for the recommen- 
dation spreading model. We used N = 2000, m = 10, and 50 trials. The significant deviations from mean 
field theory for z suddenly increasing are due to finite size. The deviations for z flattening out are due to 
the breakdown of mean field theory discussed below. We have cut off the trajectories once they begin to 
show significant deviations. 

Let us now describe why the dynamics of the recommendation spreading model are, at mean field 
level, SI epidemic dynamics. The answer can be seen by mapping to a simpler problem, in the following 
way. Define i.i.d. random variables X v for each v S V, with X v ~ Bernoulli((l + A) -1 ), and remove 
from the graph G all nodes v with X v = 0. The graph we are left with, which we call G' , can be used to 
understand the t = 00 state of a sample path for the recommendation model, in the following way: G' 
consists of the possible nodes which will become As, if they have the chance to get infected. Now, given a 
set of nodes which are A at t = 0, we conclude that a final state for the dynamics of the recommendation 
spreading model is given by 

{A v not removed, in the same cluster as an initial A 
D v removed, connected to an A . (75) 

S otherwise 

Furthermore, this final state has the same probability of occurring as the sum of all possible configurations 
of the "removed node" model which lead to this same final state. Given these states at t — > 00, we can 
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determine a sample path of the recommendation model by thus treating recommendation spreading as a 
SI epidemic on G' with spreading rate 1. 

This map to the SI epidemic on a reduced graph has a very interesting property, however - it reveals 
that the recommendation spreading model actually has an "epidemic threshold" in the following sense: 
suppose that G' is almost surely a collection of clusters of 0(1) nodes. Then if, at t = 0, an 0(1) number 
of the nodes are A, at t = oo an 0(1) number of nodes are A, implying that there is no recommendation 
"epidemic." A recommendation epidemic can only occur when the the cluster size grows with N. This 
epidemic threshold does not occur within the context of mean field theory, and this is ultimately the 
crucial difference between the recommendation spreading model and the SIR-like models discussed above. 

Given this understanding of the late time dynamics, we now return to Figure In particular (neglect- 
ing the constant factor making mean field theory differ from numerics for 7 = 2.5), we see that for very 
small A, the only divergence from mean field theory is a finite size effect, because the probability that a 
giant cluster would not be present is presumably vanishingly small. However, for larger A, the probability 
that disconnected clusters occur becomes larger, and the value of z at which the dynamics stops suggests 
the frequency with which such clusters occur. For these larger values of A, the dynamics of z therefore 
deviates from mean field theory because the ending state of the dynamics is dependent on the existence 
and frequency of such clusters, and once the dynamics is dependent on graph structure, mean field theory 
breaks down. 

5. Conclusion 



In this paper, we have shown that 4 simple models of irreversible dynamics on networks: the SIR epidemic, 
the SI STD epidemic, type I rumor spreading, and the new recommendation spreading model, have exact 
solutions at mean field level, and that these solutions hold up well in the appropriate regimes against 
numerical tests, differing at most by a constant scaling factor which is not too dramatic O Thus, these 
results provide a far more thorough justification that mean field theory is a valid approximation scheme 
for these models than previous works. Interestingly, proper regularization of divergences which can occur 
on heavy tailed degree distributions, such as those of scale free graphs, proved not only to be necessary 
mathematically, but to provide important physical insights as well. 

Ultimately, the SIR epidemic models, and the type IR or I rumor spreading models, are surely over- 
simplifications for realistic processes (and it is likely that realistic networks have far more structure than 
a simple "mean field" scale free network), so the ultimate relevance of work such as this is to understand 
qualitatively why network structures can lead to dramatic changes in the behavior of stochastic processes. 
Towards this end, knowledge of an exact solution can help to solidify intuition that more heuristic ap- 
proaches give, and can suggest phenomena that heuristic approaches may miss. We have showed that 
the exact solutions of mean field theory, which is often a valid approximation, provide all of the physical 
information of interest (Sfc, If., and R^) other than information dependent on the graph structure. Fi- 
nally, we were able to both provide theoretical explanations for many observed phenomena, as well as to 
postulate some new behaviors and observe them. 

Recent work has mathematically proven some significant deviations from mean field behavior - in 
particular, an absence of an epidemic threshold on scale free graphs of all degrees [7J . While their results 
do not become relevant until N ~ 10 12 , they showed that nonetheless mean field theory can sometimes be 
outright wrong, even on random graphs where physicists are most confident in mean field theory. We hope 
that the (quite likely rare) existence of models whose mean field theory equations have exact solutions on 
arbitrary networks will provide key tests of when and where mean field theory is a valid approximation 

9 Why exactly such scaling factors occur is an open question - part of the reason may be simplifications in the expression 
for 8(z), e.g. 
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for simplified models of realistic networks and processes. Future work should focus on understanding the 
extent to which our techniques may be applied to more complicated models, or other classes of models 
which may admit similar solutions, or focusing more in depth on some of the qualitative arguments we 
made (e.g., if r spre ad ~ 0(1)) which are not readily observable from our basic simulations. 



Acknowledgements 



I would like to thank Daniel Fisher, Greg ver Steeg and Jay Wacker for helpful comments and for encour- 
aging me to continue past my initial calculations. 



References 



[1] M. Abramowitz and LA. Stegun. Handbook of Mathematical Functions (10 th ed., 1972). 

[2] A. Barrat, M. Barthelemy, and A. Vespignani. Dynamical Processes on Complex Networks (2008). 

[3] M. Barthelemy, A. Barrat, R. Pastor-Satorras, and A. Vespignani. "Dynamical patterns of epi- 
demic outbreaks in complex heterogeneous networks", Journal of Theoretical Biology 235 (2005) 
| [cond-mat/0410330] [ 

[4] M. Barthelemy, A. Barrat, R. Pastor-Satorras, and A. Vespignani. "Velocity and hierarchi- 
cal spread of epidemic outbreaks in scale-free networks", Physical Review Letters 92 (2004) 
|[cond-mat/0311501]l 

[5] M. Blattner and M. Medo. "Recommendation systems in the scope of opinion formation: a model", 
I [1206. 3924] [ 

[6] M. Boguha, R. Pastor-Satorras, and A. Vespignani. "Epidemic spreading in complex networks with 
degree correlations", Lecture Notes in Physics 625 (2003) [cond-mat/0301149] . 

[7] S. Chatterjee and R. Durrett. "Contact processes on random graphs with power law degree distri- 
butions have critical value 0" , The Annals of Probability 37 (2009) I [0912 . 1699]] 

[8] S. Gomez, J. Gomez-Gardehes, Y. Moreno, and A. Arenas. "Nonperturbative heterogeneous 
mean-field approach to epidemic spreading in complex networks", Physical Review E84 (2011) 
I [1106. 6184] [ 

[9] J. Gomez-Gardehes, V. Latora, Y. Moreno, and E.V. Profumo. "Spreading of sexually transmitted 
diseases in heterosexual populations", Proceedings of the National Academy of Sciences 105 (2008) 
I [0707. 1672] [ 

[10] H. Khan, R.N. Mohapatra, K. Vajravelu, and S.J. Liao. "The explicit series solution of SIR and SIS 
epidemic models", Applied Mathematics and Computation 215 (2009). 

[11] P.L. Krapivsky and S. Redner. "Organization of growing random networks", Physical Review E63 
(2001) [cond-mat/0011094]l 

[12] J. Libre and C. Vails. "Integrability of a SIS model", Journal of Mathematical Analysis and Appli- 
cations 344 (2008). 



Exact mean field dynamics for epidemic-like processes on heterogeneous networks 



20 



[13] M. Marder. "Dynamics of epidemics on random networks", Physical Review E75 (2007). 

[14] R.M. May and A.L. Lloyd. "Infection dynamics on scale- free networks", Physical Review E64 (2002). 

[15] J.C. Miller. "A note on a paper by Erik Volz: SIR dynamics in random networks", Journal of 
Mathematical Biology 62 (2011) | [0909 . 4485]] 

[16] J.C. Miller, A.C. Slim, and E.M. Volz. "Edge-based compartmental modeling for infectious diseases. 
Part I: An overview" , I [1106 . 6320] \ 

[17] J.C. Miller and E.M. Volz. "Edge-based compartmental modeling for infectious diseases. Part II: 
Model selection and hierarchies", | [11 06 . 6319] . 

[18] J.C. Miller and E.M. Volz. "Edge-based compartmental modeling for infectious diseases. Part III: 
Disease and population structure", [1106.6344] , 

[19] Y. Moreno, J.B. Gomez, and A.F. Pacheco. "Epidemic incidence in correlated complex networks", 
Physical Review E68 (2003) | [cond-mat/0309462][ 

[20] Y. Moreno, M. Nekovee, and A.F. Pacheco. "Dynamics of rumor spreading in complex networks", 
Physical Review E69 (2004) |[cond-mat/0312131][ 

[21] Y. Moreno, M. Nekovee, and A. Vespignani. "Efficiency and reliability of epidemic data dissemination 
in complex networks" , Physical Review E69 (2004) | [cond-mat/0311212"n 

[22] J.D. Murray. Mathematical Biology I: An Introduction (3 rd ed., 2002). 

[23] M. Nekovee, Y. Moreno, G. Bianconi, and M. Marsili. "Theory of rumour spreading in complex 
social networks" , Physica A374 (2007) I [0807 . 1458]] 

[24] M.E.J. Newman. "The spread of epidemic disease on networks", Physical Review E66 (2002) 
| [cond-mat/0205009] [ 

[25] P-A. Noel, B. Davoudi, R.C. Brunham, L.J. Dube, and B. Pourbohloul. "Time evolution of epidemic 
disease on finite and infinite networks" , Physical Review E79 (2009) [0804 . 1807] [ 

[26] M.C. Nucci and P.G.L. Leach. "An integrable SIS model", Journal of Mathematical Analysis and 
Applications 290 (2004). 

[27] R. Pastor-Satorras and A. Vespignani. "Epidemic dynamics in finite size scale-free networks", Phys- 
ical Review E65 (2002) | [cond-mat/0202298][ 

[28] R. Pastor-Satorras and A. Vespignani. "Epidemic spreading in scale-free networks" , Physical Review 
Letters 86 (2001) |[cond-mat/0010317][ 

[29] E. Volz. "SIR dynamics in random networks with heterogeneous connectivity", Journal of Mathe- 
matical Biology 56 (2007) [0705.2092T1 

[30] H.T. Williams, I. Mazilu, and D.A. Mazilu. "Stochastic epidemic-type model with enhanced con- 
nectivity: exact solution", Journal of Statistical Mechanics: Theory and Experiment 2012 (2012) 
|[1108.5135][ 



